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Description 

Technical Field of the Invention 

[0001] The present invention pertains to a method for the screening of compounds for biological activity and to the 
determination of binding dissociation constants using two-dimensional 15 N/ 1 H NMR correlation spectral analysis to 
identify and design ligands that bind to a target biomolecule. 

Background of the Invention 

[0002] One of the most powerful tools for discovering new drug leads is random screening of synthetic chemical and 
natural product databases to discover compounds that bind to a particular target molecule (i.e., the identification of 
ligands of that target). Using this method, ligands may be identified by their ability to form a physical association with 
a target molecule or by their ability to alter a function of a target molecule. 

[0003] When physical binding is sought, a target molecule is typically exposed to one or more compounds suspected 
of being ligands and assays are performed to determine if complexes between the target molecule and one or more 
of those compounds are formed. Such assays, as is well known in the art, test for gross changes in the target molecule 
(e.g., changes in size, charge, mobility) that indicate complex formation. 

[0004] Where functional changes are measured, assay conditions are established that allow for measurement of a 
biological or chemical event related to the target molecule (e.g., enzyme catalyzed reaction, receptor-mediated enzyme 
activation). To identify an alteration, the function of the target molecule is determined before and after exposure to the 
test compounds. 

[0005] Existing physical and functional assays have been used successfully to identify new drug leads for use in 
designing therapeutic compounds. There are, however, limitations inherent to those assays that compromise their 
accuracy, reliability and efficiency. 

[0006] A major shortcoming of existing assays relates to the problem of "false positives'. In a typical functional assay, 
a "false positive" is a compound that triggers the assay but which compound is not effective in eliciting the desired 
physiological response. In a typical physical assay, a "false positive" is a compound that, for example, attaches itself 
to the target but in a non-specific manner (e.g., non-specific binding). False positives are particularly prevalent and 
problematic when screening higher concentrations of putative ligands because many compounds have non-specific 
effects at those concentrations. 

[0007] In a similar fashion, existing assays are plagued by the problem of "false negatives", which result when a 
compound gives a negative response in the assay but which compound is actually a ligand for the target. False neg- 
atives typically occur in assays that use concentrations of test compounds that are either too high (resulting in toxicity) 
or too low relative to the binding or dissociation constant of the compound to the target. 

[0008] Another major shortcoming of existing assays is the limited amount of information provided by the assay itself. 
While the assay may correctly identify compounds that attach to or elicit a response from the target molecule, those 
assays typically do not provide any information about either specific binding sites on the target molecule or structure 
activity relationships between the compound being tested and the target molecule. The inability to provide any such 
information is particularly problematic where the screening assay is being used to identify leads for further study. 
[0009] It has recently been suggested that X-ray crystallography can be used to identify the binding sites of organic 
solvents on macromolecules. However, this method cannot determine the relative binding affinities at different sites 
on the targeL It is only applicable to very stable target proteins that do not denature in the presence of high concen- 
trations of organic solvents. Moreover, this approach is not a screening method for rapidly testing many compounds 
that are chemically diverse, but is limited to mapping the binding sites of only a few organic solvents due to the long 
time needed to determine the individual crystal structures. 

[001 0] Compounds are screened to identify leads that can be used in the design of new drugs that alter the function 
of the target biomolecule. Those new drugs can be structural analogs of identified leads or can be conjugates of one 
or more such lead compounds. Because of the problems inherent to existing screening methods, those methods are 
often of little help in designing new drugs. 

[0011] There continues to be a need to provide new, rapid, efficient, accurate and reliable means of screening com- 
pounds to identify and design ligands that specifically bind to a particular target. 

Brief Summary of the Invention 

[0012] In one aspect, the present invention provides a process of screening compounds for biological activity to 
identify ligands that bind to a specific target molecule. That process comprises the steps of: a) generating a first two- 
dimensional ^N^H NMR correlation spectrum of a 15 N-labeled target molecule; b) exposing the labeled target mole- 
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cule to one or a mixture of chemical compounds; c) generating a second two-dimensional * S NPH NMR correlation 
spectrum of the labeled target molecule that has been exposed to one or a mixture of compounds in step (b); and d) 
comparing said first and second two-dimensional 15 N/ 1 H NMR correlation spectra to determine differences between 
said first and said second spectra, the differences identifying the presence of one or more compounds that are ligands 

s which have bound to the target molecule. 

[0013] Where the process of the present invention screens more than one compound in step (b), that is, a mixture 
of compounds, and where a difference between the first spectrum generated from the target molecule alone and that 
generated from the target molecule in the presence of the mixture, additional steps are performed to identify which 
specific compound or compounds contained in the mixture is binding to the target molecule. Those additional steps 

io comprise the steps of e) exposing the 15 N-labeled target molecule individually to each compound of the mixture, f) 
generating a two-dimensional 15 N/ 1 H NMR correlation spectrum of the labeled target molecule that has been individ- 
ually exposed to each compound; and g) comparing each spectrum generated in step f) to the first spectrum generated 
from the target molecule alone to determine differences in any of those compared spectra, the differences identifying 
the presence of a compound that is a ligand which has bound to the target molecule. 

is [0014] Because the chemical shift values of the particular ^NA'H signals in the two-dimensional correlation spectrum 
correspond to known specific locations of atomic groupings in the target molecule (e.g., the N-H atoms of the amide 
or peptide link of a particular amino acid residue in a polypeptide), the process of the present invention allows not only 
for the for identification of which compound(s) bind to a particular target molecule, but also permit the determination 
of the particular binding site of the ligand on the target molecule. 

20 [0015] In a second aspect, the present invention provides a process of determining the dissociation constant, Kq, 
for a given ligand and its target molecule. That process comprises the steps of a) generating a first two- 
dimensional 15 N/ 1 H NMR correlation spectrum of a 15 N-labeled target molecule; b) exposing the labeled target mole- 
cule to various concentrations of a ligand; c) generating a two-dimensional ^N^H NMR correlation spectrum at each 
concentration of ligand in step (b); d) comparing each spectrum from step (c) to the first spectrum from step (a); and 

25 e) calculating the dissociation constant between the target molecule and the ligand from those differences according 
to the equation: 

„ ([P] o -x)([L] 0 -x) 



[0016] An advantageous aspect of the present invention is the capability of the process of the present invention to 
determine the dissociation constant of one ligand of the target molecule in the presence of a second molecule already 
bound to the ligand. This is generally not possible with prior art methods which employ "wet chemical" analytical meth- 
35 ods of determining binding of a ligand to a target molecule substrate. 

[0017] In this preferred embodiment, the process of determining the dissociation constant of a ligand can be per- 
formed in the presence of a second bound ligand. In accordance with this embodiment, the 15 N-labeled target molecule 
is bound to that second ligand before exposing that target to the test compounds. 

[0018] The ability of the present method to determine not only the existence of binding between one ligand and the 
40 target molecule, but also the particular site of binding in the presence of a second bound ligand permits the capability 
to design a drug that comprises two or more linked moieties made up of the ligands. 

[0019] This method uses the two-dimensional 15 N/ 1 H NMR correlation spectroscopic screening process as set forth 
above to identify a first and subsequent ligands that bind to the target molecule. A complex of the target molecule and 
two or more ligands is formed and the three-dimensional structure of that complex is determined preferably using NMR 
45 spectroscopy or X-ray crystallography. That three-dimensional structure is used to determine the spatial orientation of 
the ligands relative to each other and to the target molecule. 

[0020] Based on the spatial orientation, the ligands are linked together to form the drug. The selection of an appro- 
priate linking group is made by maintaining the spatial orientation of the ligands to one another and to the target 
molecule based upon principles of bond angle and bond length information well known in the organic chemical art. 

so [0021] Thus, the molecular design method comprises identifying a first ligand moiety to the target molecule using 
two-dimensional ^NA'H NMR correlation spectroscopy; identifying subsequent ligand moieties to the target molecule 
using two-dimensional ^N^H NMR correlation spectroscopy; forming a complex of the first and subsequent ligand 
moieties to the target molecule; determining the three dimensional structure of the complex and, thus, the spatial 
orientation of the first and subsequent ligand moieties on the target molecule; and linking the first and subsequent 

55 ligand moieties to form the drug to maintain the spatial orientation of the ligand moieties. 

[0022] The identification of subsequent ligand moieities can be performed in the absence or presence of the first 
ligand (e.g., the target molecule can be bound to the first ligand before being exposed to the test compounds for 
identification of the second ligand). 
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[0023] In a preferred embodiment, the target molecule used in a screening or design process is a polypeptide. The 
polypeptide target is preferably produced in recombinant form from a host cell transformed with an expression vector 
that contains a polynucleotide that encodes the polypeptide, by culturing the transformed host cell in a medium that 
contains an assimilable source of 15 N such that the recombinants produced polypeptide is labeled with 15 N, 

Brief Description of the Drawings 

[0024] In the drawings which form a portion of the specification: 

FIG. 1 shows a 15 N/ 1 H correlation spectrum of the DNA binding domain of uniformly 15 N-labeled human papillo- 
mavirus E2. The spectrum (80 complex points, 4 scans/fid) was acquired on a 0.5 mM sample of E2 in 20 
mM phosphate (pH 6.5), 10 mM dithiothreitol (DTT) and 10% deuterium oxide (D 2 0). 

FIG. 2 shows 15 N/ 1 H correlation spectra of the DNA binding domain of uniformly 15 N-labeled human papillomavirus 
E2 before (thin multiple contours) and after (thick single contours) addition of a final test compound. The 
final concentration of compound was 1.0 mM. All other conditions are as stated in FIG. 1. Selected residues 
that show significant changes upon binding are indicated. 

FIG. 3 shows 15 N/ 1 H correlation spectra of the DNA binding domain of uniformly 15 N-labeled human papillomavirus 
E2 before (thin multiple contours) and after (thick single contours) addition of a second test compound. The 
final concentration of compound was 1 .0 mM. All other conditions are as stated in FIG. 1 . Selected residues 
that show significant changes upon binding are indicated. 

FIG. 4 shows ^N^H correlation spectra of the catalytic domain of uniformly 15 N-labeled stromelysin before (thin 
multiple contours) and after (thick single contours) addition of a test compound. The final concentration of 
compound was 1 .0 mM. The spectra (80 complex points, 8 scans/fid) were acquired on a 0.3 mM sample 
of SCD in 20 mM TRIS (pH 7.0), 20 mM CaCI 2 and 10% D 2 0. Selected residues that show significant 
changes upon binding are indicated. 

FIG. 5 shows 15 N/ 1 H correlation spectra of the Ras-binding domain of uniformly 15 N-labeled RAF peptide (residues 
55-132) before (thin multiple contours) and after (thick single contours) addition of a test compound. The 
final concentration of compound was 1 .0 mM. The spectra (80 complex points, 8 scans/fid) were acquired 
on a 0.3 mM sample of the RAF fragment in 20 mM phosphate (pH 7.0), 1 0 mM DTT and 1 0% D 2 0. Selected 
residues that show significant changes upon binding are indicated. 

FIG. 6 shows 15 N/ 1 H correlation spectra of uniformly 15 N-labeled FKBP before (thin multiple contours) and after 
(thick single contours) addition of a test compound. The final concentration of compound was 1 .0 mM. The 
spectra (80 complex points, 4 scans/fid) was acquired on a 0.3 mM sample of FKBP in 50 mM phosphate 
(pH 6.5), 100 mM NaCI and 10% D 2 0. Selected residues that show significant changes upon binding are 
indicated. 

FIG. 7 shows a first depiction of the NMR-derived structure of the DNA-binding domain of E2. The two monomers 
of the symmetric dimer are oriented in a top-bottom fashion, and the N- and C-termini of each monomer 
are indicated (N and C for one monomer, N* and C* for the other). Shown in ribbons are the residues which 
exhibit significant chemical shift changes (A5( 1 H)>0.04 ppm; A5( 15 N) >0.1 ppm) upon binding to a first test 
compound. These residues correspond to the DNA-recognition helix of E2. Selected residues are numbered 
for aid in visualization. 

FIG. 8 shows a second depiction of the NMR-derived structure of the DNA-binding domain of E2. The two mono- 
mers of the symmetric dimer are oriented in a top-bottom fashion, and the N-and C-termini of each monomer 
are indicated (N and C for one monomer, N # and C* for the other). Shown in ribbons are the residues which 
exhibit significant chemical shift changes (A5( 1 H)>0.04 ppm; A5( 15 N) >0.1 ppm) upon binding to a second 
test compound. These residues are located primarily in the dimer interface region. Selected residues are 
numbered for aid in visualization. 

FIG. 9 shows a depiction of the NMR-derived structure of the catalytic domain of stromelysin. The N- and C-termini 
are indicated. Shown in ribbons are the residues which exhibit significant chemical shift changes (A8( 1 H) 
>0.04 ppm; A8( 15 N) >0.1 ppm) upon binding to a test compound. These either form part of the SV binding 
site or are spatially proximal to this site. Selected residues are numbered for aid in visualization. 

FIG. 10 shows a ribbon plot of a ternary complex of first and second ligands bound to the catalytic domain of strome- 
lysin. 

Detailed Description of the Invention 

[0O2S] The present invention provides a rapid and efficient screening method for identifying ligands that bind to 
therapeutic target molecules. 
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[0026] Ligands are identified by testing the binding of molecules to a target molecule (e.g., protein, nucleic acid, etc.) 
by following, with nuclear magnetic resonance (NMR) spectroscopy, the changes in chemical shifts of the target mol- 
ecule upon the addition of the ligand compounds in the database. 

[0027] From an analysis of the chemical shift changes of the target molecule as a function of ligand concentration, 
5 the binding affinities of ligands for biomolecules are also determined. 

[0028] The location of the binding site for each ligand is determined from an analysis of the chemical shifts of the 
biomolecule that change upon the addition of the ligand and from nuclear Overhauser effects (NOES) between the 
ligand and biomolecule. 

[0029] Information about the structure/activity relationships between ligands identified by such a process can then 
10 be used to design new drugs that serve as ligands to the target molecule. By way of example, where two or more 
ligands to a given target molecule are identified, a complex of those ligands and the target molecule is formed. The 
spatial orientation of the ligands to each other as well as to the target molecule is derived from the three-dimensional 
structure. That spatial orientation defines the distance between the binding sites of the two ligands and the orientation 
of each ligand to those sites. 

*s [0030] Using that spatial orientation data, the two or more ligands are then linked together to form a new ligand. 
Linking is accomplished in a manner that maintains the spatial orientation of the ligands to one another and to the 
target molecule. 

[0031] There are numerous advantages to the NMR-based discovery process of the present invention. First, because 
a process of the present invention identifies ligands by directly measuring binding to the target molecule, the problem 
20 of false positives is significantly reduced. Because the present process identifies specific binding sites to the target 
molecule, the problem of false positives resulting from the non-specific binding of compounds to the target molecule 
at high concentrations is eliminated. 

[0032] Second, the problem of false negatives is significantly reduced because the present process can identify 
compounds that specifically bind to the target molecule with a wide range of dissociation constants. The dissociation 
25 or binding constant for compounds can actually be determined with the present process. 

[0033] Other advantages of the present invention result from the variety and detailed data provided about each ligand 
from the discovery process. 

[0034] Because the location of the bound ligand can be determined from an analysts of the chemical shifts of the 
target molecule that change upon the addition of the ligand and from nuclear Overhauser effects (NOEs) between the 
so ligand and biomolecule, the binding of a second ligand can be measured in the presence of a first ligand that is already 
bound to the target. The ability to simultaneously identify binding sites of different ligands allows a skilled artisan to 1 ) 
define negative and positive cooperative binding between ligands and 2) design new drugs by linking two or more 
ligands into a single compound while maintaining a proper orientation of the ligands to one another and to their binding 
sites. 

35 [0035] Further, if multiple binding sites exist, the relative affinity of individual binding moieties for the different binding 
sites can be measured from an analysis of the chemical shift changes of the target molecule as a function of the added 
concentration of the ligand. By simultaneously screening numerous structural analogs of a given compound, detailed 
structure/activity relationships about ligands is provided. 

[0036] In its principal aspect, the present invention provides a process of screening compounds to identify ligands 
40 that bind to a specific target molecule. That process comprises the steps of: a) generating a first two-dimensional 1 S HP H 
NMR correlation spectrum of a 15 N-labeled target molecule; b) exposing the labeled target molecule to one or more 
compounds; c) generating a second two-dimensional ^N/ 1 H NMR correlation spectrum of the labeled target molecule 
that has been exposed to the compounds of step (b); and d) comparing the first and second spectra to determine 
whether differences in those two spectra exist, which differences indicate the presence of one or more ligands that 
45 have bound to the target molecule. 

[0037] Where a process of the present invention screens more than one compound in step (b) and where a difference 
between spectra is observed, additional steps are performed to identify which specific compound is binding to the 
target molecules. Those additional steps comprise generating a two-dimensional * S NPH NMR correlation spectrum 
for each individual compound and comparing each spectrum to the first spectrum to determine whether differences in 
50 any of those compared spectra exist, which differences indicate the presence of a ligand that has bound to the target 
molecule. 

[0038] Any 15 N-labeled target molecule can be used in a process of the present invention. Because of the importance 
of proteins in medicinal chemistry, a preferred target molecule is a polypeptide. The target molecule can be labeled 
with 15 N using any means well known in the art. In a preferred embodiment, the target molecule is prepared in recom- 
55 binant form using transformed host cells. In an especially preferred embodiment, the target molecule is a polypeptide. 
Any polypeptide that gives a high resolultion NMR spectrum and can be partially or uniformly labeled with 15 N can be 
used. The preparation of uniformly 15 N-labeled exemplary polypeptide target molecules is set forth hereinafter in the 
Examples. 
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[0039] A preferred means of preparing adequate quantities of uniformly l5 N-labeled polypeptides is to transform a 
host celt with an expression vector that contains a polynucleotide that encodes that polypeptide and culture the trans- 
formed cell in a culture medium that contains assimilable sources of 15 N. Assimilable sources of 15 N are well known 
in the art. A preferred such source is 15 NH 4 CI. 
5 [0040] Means for preparing expression vectors that contain polynucleotides encoding specific polypeptides are well 
known in the art. In a similar manner, means for transforming host cells with those vectors and means for culturing 
those transformed cells so that the polypeptide is expressed are also well known in the art. 

[0041] The screening process of the present invention begins with the generation or acquisition of a two- 
dimensional 15 N/ 1 H correlation spectrum of the labeled target molecule. Means for generating two-dimensional 15 N/ 1 H 
w correlation spectra are well known in the art (see, e.g., D. A. Egan et al.. Biochemistry , 32(8): 1920-1927 (1993); Bax, 
A., Grzesiek, S., Acc. Chem. Res., 26(4): 131-138 (1993)). 

[0042] The NMR spectra that are typically recorded in the screening procedure of the present invention are two- 
dimensional 15 N/ 1 H heteronuclear single quantum correlation (HSQC) spectra. Because the^N^H signals corre- 
sponding to the backbone amides of the proteins are usually well-resolved, the chemical shift changes for the individual 

15 amides are readily monitored. 

[0043] In generating such spectra, the large water signal is suppressed by spoiling gradients. To facilitate the acqui- 
sition of NMR data on a large number of compounds (e.g., a database of synthetic or naturally occurring small organic 
compounds), a sample changer is employed. Using the sample changer, a total of 60 samples can be run unattended. 
Thus, using the typical acquisition parameters (4 scans per free induction decay (fid), 100-120 HSQC spectra can be 

20 acquired in a 24 hour period. 

[0044] To facilitate processing of the NMR data, computer programs are used to transfer and automatically process 
the multiple two-dimensional NMR data sets, including a routine to automatically phase the two-dimensional NMR data. 
The analysis of the data can be facilitated by formatting the data so that the individual HSQC spectra are rapidly viewed 
and compared to the HSQC spectrum of the control sample containing only the vehicle for the added compound (DM- 

25 SO), but no added compound. Detailed descriptions of means of generating such two-dimensional 1S N/ 1 H correlation 
spectra are set forth hereinafter in the Examples. 

[0045] A representative two-dimensional ^N^H NMR correlation spectrum of an 15 N-labeled target molecule 
(polypeptide) is shown in FIG. 1 (the DNA-binding domain of the E2 protein). 

[0046] Following acquisition of the first spectrum, the labeled target molecule is exposed to one or more test com- 
30 pounds. Where more than one test compound is to be tested simultaneously, it is preferred to use a database of 
compounds such as a plurality of small molecules. Such molecules are typically dissolved in perdeuterated dimethyl- 
sulfoxide. The compounds in the database can be purchased from vendors or created according to desired needs. 
[0047] Individual compounds can be selected inter alia on the basis of size (molecular weight = 100-300) and mo- 
lecular diversity. Compounds in the collection can have different shapes (e.g., flat aromatic rings(s), puckered aliphatic 
35. rings(s), straight and branched chain aliphatics with single, double, or triple bonds) and diverse functional groups (e. 
- g., carboxylic acids, esters, ethers, amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the 
possibility of discovering compounds that interact with widely diverse binding sites. 

[0048] The NMR screening process of the present invention utilizes ligand concentrations ranging from about 0.1 to 
about 10.0 mM. At these concentrations, compounds which are acidic or basic can significantly change the pH of 
40 buffered protein solutions. Chemical shifts are sensitive to pH changes as well as direct binding interactions, and false 
positive' 1 chemical shift changes, which are not the result of ligand binding but of changes in pH, can therefore be 
observed. It is thus necessary to ensure that the pH of the buffered solution does not change upon addition of the 
ligand. One means of controlling pH is set forth below. 

[0049] Compounds are stored at -10°C (263° K) as 1 .0 and 0. 1 M stock solutions in dimethylsulfoxide (DMSO). This 
45 is necessary because of the limited solubility of the ligands in aqueous solution. It is not possible to directly adjust the 
pH of the DMSO solution. In addition, HCI and NaOH form insoluble salts in DMSO, so alternative acids and bases 
must be used. The following approach has been found to result in stable pH. 

[0050] The 1.0 M stock solutions in DMSO are diluted 1:10 in 50 mM phosphate, pH 7.0. The pH of that diluted 
aliquot solution is measured. If the pH of the aliquot is unchanged (i.e., remains at 7.0), a working solution is made by 

50 diluting the DMSO stock solution 1:10 to make a 0.1 M solution and that solution is stored. 

[0051] If the pH of the diluted aliquot is less than 7.0, ethanolamine is added to the 1 .0 M stock DMSO solution, that 
stock solution is then diluted 1 :10 with phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 
[0052] If the pH of the diluted aliquot is greater than 7.0, acetic acid is added to the 1 .0 M stock DMSO solution, that 
stock solution is then diluted 1:10 with phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 

55 [0053] Ethanolamine and acetic acid are soluble in DMSO, and the proper equivalents are added to ensure that upon 
transfer to aqueous buffer, the pH is unchanged. Adjusting the pH is an interactive process, repeated until the desired 
result is obtained. 

[0054] Note that this procedure is performed on 1:10 dilutions of 1 .0 M stock solutions (100 mM ligand) to ensure 
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that no pH changes are observed at the lower concentrations used in the experiments (0.1 to 10 mM) or in different/ 
weaker buffer systems. 

[00S5] Following exposure of the 15 N-labeled target molecule to one or more test compounds, a second two- 
dimensional 15 N/ 1 H NMR correlation spectrum is generated. That second spectrum is generated in the same manner 
5 as set forth above. The first and second spectra are then compared to determine whether there are any differences 
between the two spectra. Differences in the two-dimensional 15 N/ 1 H NMR correlation spectra that indicate the presence 
of a ligand correspond to 15 N-labeled sites in the target molecule. Those differences are determined using standard 
procedures well known in the art. 

[0056] By way of example, FIGS. 2, 3, 4, 5 and 6 show comparisons of correlation spectra before and after exposure 
70 of various target molecules to various test compounds. A detailed description of how these studies were performed 
can be found hereinafter in Examples 2 and 3. 

[0057] Particular signals in a two-dimensional 15 N/ 1 H correlation spectrum correspond to specific nitrogen and proton 
atoms in the target molecule (e.g., particular amides of the amino acid residues in the protein). By way of example, it 
can be seen from FIG. 2 that chemical shifts in a two-dimensional A5 NP H correlation of the DNA-binding domain of E2 
is exposed to a test compound occurred at residue positions 15 (115), 21 (Y21), 22 (R22) and 23 (L23). 

[0058] It can be seen from FIG. 2 that the binding of the ligand involved the isoleucine (He) residue at position 15, 
the tyrosine (Tyr) residue at position 21, the arginine (Arg) residue at position 22 and the leucine (Leu) residue at 
position 23. Thus, a process of the present invention can also be used to identify the specific binding site between a 
ligand and target molecule. 

20 [0059] The region of the protein that is responsible for binding to the individual compounds is identified from the 
particular amide signals that change upon the addition of the compounds. These signals are assigned to the individual 
amide groups of the protein by standard procedures using a variety of well-established heteronuclear multi-dimensional 
NMR experiments. 

[0060] To discover molecules that bind more tightly to the protein, molecules are selected for testing based on the 

25 structure/activity relationships from the initial screen and/or structural information on the initial leads when bound to 
the protein. By way of example, the initial screening may result in the identification of ligands, all of which contain an 
aromatic ring. The second round of screening would then use other aromatic molecules as the test compounds. 
[0061] As set forth hereinafter in Example 2, an initial screening assay for binding to the catalytic domain of strome- 
lysin identified two biaryl compounds as ligands. The second round of screening thus used a series of biaryl derivatives 

30 as the test compounds. 

[0062] The second set of test compounds are initially screened at a concentration of 1 mM, and binding constants 
are measured for those that show affinity. Best leads that bind to the protein are then compared to the results obtained 
in a functional assay. Those compounds that are suitable leads are chemically modified to produce analogs with the 
goal of discovering a new pharmaceutical agent. 

35 [0063] In another aspect, the present invention provides a process for determining the dissociation constant between 
a target molecule and a ligand that binds to that target molecule. That process comprises the steps of: a) generating 
a first two-dimensional 15 N/ 1 H NMR correlation spectrum of a 15 N- labeled target molecule; b) titrating the labeled target 
molecule with various concentrations of a ligand; c) generating a two-dimensional 15 N/ n H NMR correlation spectrum 
at each concentration of ligand from step (b); d) comparing each spectrum from step (c) both to the first spectrum from 

40 step (a) and to all other spectra from step (c) to quantify differences in those spectra as a function of changes in ligand 
concentration; and e) calculating the dissociation constant (K D ) between the target molecule and the ligand from those 
differences. 

[0064] Because of their importance in medicinal chemistry, a preferred target molecule for use in such a process is 
a polypeptide. In one preferred embodiment, a process of determining the dissociation constant of a ligand can be 
45 performed in the presence of a second ligand. In accordance with this embodiment, the 15 N-labeled target molecule 
is bound to that second ligand before exposing that target to the test compounds. 

[0065] Binding or dissociation constants are measured by following the 15 N/ 1 H chemical shifts of the protein as a 
function of ligand concentration. A known concentration ([P] 0 ) of the target moleule is mixed with a known concentration 
([L] 0 ) of a previously identified ligand and the two-dimensional 15 N/ 1 H correlation spectrum was acquired. From this 
50 spectrum, observed chemical shift values (8 0bs ) are obtained. The process is repeated for varying concentrations of 
the ligand to the point of saturation of the target molecule, when possible, in which case the limiting chemical shift 
value for saturation (6^,) is measured. 

[0066] In those situations where saturation of the target molecule is achieved, the dissociation constant for the binding 
of a particular ligand to the targer molecule is calculated using the formula: 

55 

„ gp] 0 -x)([L] 0 -x) 
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where [P] 0 is the total molar concentration of target molecule; [L] 0 is the total molar concentration of ligand; and x is 
the molar concentration of the bound species. The value of x is determined from the equation: 

5 v - 5 <*s " § free 

A 

where 5 free is the chemical shift of the free species; 8 obs is the observed chemical shift; and A is the difference between 
the limiting chemical shift value for saturation (S^,) and the chemical shift value of the target molecule free of ligand 

10 (Sfree) 

[0067] The dissociation constant is then determined by varying its value until a best fit to the observed data is obtained 
using standard curve-fitting statistical methods. In the case where 8^ is not directly known, both Kq and 8^ are varied 
and subjected to the same curve-fitting procedure. 

[0068] The use of the process of the present invention to determine the dissociation or binding affinity of various 
is ligands to various target molecules is set forth hereinafter in Examples 2 and 3. 

[0069] Preferred target molecules, means for generating spectra, and means for comparing spectra are the same 
as set forth above. 

[0070] The initial step in the design process is the identification of two or more ligands that bind to the specific target 
molecule. The identification of such ligands is done using two-dimensional 15 N/tH NMR correlation spectroscopy as 
20 set forth above. 

[0071] Once two or more ligands are identified as binding to the target molecule at different sites, a complex between 
the target molecule and ligands is formed. Where there are two ligands, that complex is a ternary complex. Quaternary 
and other complexes are formed where there are three or more ligands. 

[0072] Complexes are formed by mixing the target molecule simultaneously or sequentially with the various ligands 
25 under circumstances that allow those ligands to bind the target. Means for determining those conditions are well known 
in the art. 

[0073] Once that complex is formed, its three-dimensional structure is determined. Any means of determining three- 
dimensional structure can be used. Such methods are well known in the art. Exemplary and preferred methods are 
NMR and X-ray crystallography. The use of three-dimensional double- and triple resonance NMR to determine the 
30 three-dimensional structure of two ligands bound to the catalytic domain of stromelysin is set forth in detail hereinafter 
in Example 4. 

[0074] An analysis of the three-dimensional structure reveals the spatial orientation of the ligands relative to each 
other as well as to the conformation of the target molecule. First, the spatial orientation of each ligand to the target 
molecule allows for identification of those portions of the ligand directly involved in binding (i.e., those portions inter- 
35 acting with the target binding site) and those portions of each ligand that project away from the binding site and which 
portions can be used in subsequent linking procedures. 

[0075] Second, the spatial orientation data is used to map the positions of each ligand relative to each other. In other 
words, discrete distances between the spatially oriented ligands can be calculated. 

[0076] Third, the spatial orientation data also defines the three-dimensional relationships amongst the ligands and 
40 the target. Thus, in addition to calculating the absolute distances between ligands, the angular orientations of those 
ligands can also be determined. 

[0077] Knowledge of the spatial orientations of the ligands and target is then used to select linkers to link two or more 
ligands together into a single entity that contains all of the ligands. The design of the linkers is based on the distances 
and angular orientation needed to maintain each of the ligand portions of the single entity in proper orientation to the 
45 target. 

[0078] The three-dimensional conformation of suitable linkers is well known or readily ascertainable by one of ordi- 
nary skill in the art. While it is theoreticalty possible to link two or more ligands together over any range of distance and 
three-dimensional projection, in practice certain limitations of distance and projection are preferred. In a preferred 
embodiment, ligands are separated by a distance of less than about 1 5 Angstroms (A), more preferably less than about 

so 10 A and, even more preferably less than about 5 A. 

[0079] Once a suitable linker group is identified, the ligands are linked with that linker. Means for linking ligands are 
well known in the art and depend upon the chemical structure of the ligand and the linking group itsetf. Ligands are 
linked to one another using those portions of the ligand not directly involved in binding to the target molecule. 
[0080] A detailed description of the design of a drug that inhibits the proteolytic activity of stromelysin, which drug 

55 was designed using a process of the present invention is set forth hereinafter in Example 4. 

[0081] The following Examples illustrate preferred embodiments of the present invention and are not limiting of the 
specification and claims in any way. 
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Example 1 

Preparation Of Uniformly 15 N-Labeled Target Molecules 
5 A. Stromelysin 

[0082J Human stromelysin is a 447-amino acid protein believed to be involved in proteolytic degradation of cartilage. 
Cartilage proteolysis is believed to result in degradative loss of joint cartilage and the resulting impairment of joint 
function observed in both osteoarthritis and rheumatoid arthritis. The protein possesses a series of domains including 
w N-terminal latent and propetide domains, a C-terminal domain homologous with homopexin, and an internal catalytic 
domain. 

[0083] Studies have shown that removal of the N-terminal prosequence of approximately eighty amino acids occurs 
to convert the proenzyme to the 45 kDa mature enzyme. Furthermore, studies have shown that the C-terminal homopex- 
in homologous domain is not required for proper folding of the catalytic domain or for interaction with an inhibitor. (See, 
is e.g., A. I. Marcv, Biochemistry, 30: 6476-6483 (1991 ). Thus, the 81-256 amino acid residue internal segment of strome- 
lysin was selected as the protein fragment for use in identifying compounds which bind to and have the potential as 
acting as inhibitors of stromelysin. 

[0084] To employ the method of the present invention, it was necessary to prepare the 81-256 fragment (SEQ ID 
NO: 1 ) of stromelysin in which the peptide backbone was isotopically enriched with and 15 N. This was done by inserting 
20 a plasmid which coded for the production of the protein fragment into an E. coli strain and growing the genetically- 
modified bacterial strain in a limiting culture medium enriched with 15 NH 4 CI and 13 C-glucose. 
[0085] The isotopically enriched protein fragment was isolated from the culture medium, purified, and subsequently 
used as the basis for evaluating the binding of test compounds. The procedures for these processes are described 
below. 

25 [0086] Human skin fibroblasts (ATCC No. CRL 1507) were grown and induced using the procedure described by 
Clark et al., Archiv. Biochem. and Biophys., 241 : 36-45 (1 985). Total RNA was isolated from 1 g of cells using a Promega 
RNAgents® Total RNA Isolation System Kit (Cat.# Z5110, Promega Corp., 2800 Woods Hollow Road, Madison, Wl 
53711-5399) following the manufacturer's instructions. A 1 jig portion of the RNA was heat-denatured at 80°C for five 
minutes and then subjected to reverse transcriptase PCR using a GeneAmp® RNA PCR kit (Cat.# N808-0017, Applied 

30 Biosystems/Perkin-Elmer, 761 Main Avenue, Norwalk, CT 06859-0156) following the manufacturer's instructions. 
[0087] Nested PCR was performed using first primers (A) G AAATG AAG AGTC TTCAA (SEQ ID NO:3) and (B) 
GCGTCCCAGGTTCTGGAG (SEQ ID NO:4) and thirty-five cycles of 94°C, two minutes; 45'C, two minutes; and 72°C 
three minutes. This was followed by reamplification with internal primers (C) ATACCATGGCCTATCCAT TGGAT- 
GGAGC (SEQ ID NO:5) and (D) ATAGGATCCTTAGGTCTCAGGGGA GTCAGG (SEQ ID NO:6) using thirty cycles 

35 under the same conditions described immediately above to generate a DNA coding for amino acid residues 1 -256 of 
human stromelysin. 

[0088] The PCR fragment was then cloned into PCR cloning vector pT7Blue(R) (Novagen, Inc., 597 Science Drive, 
Madison, Wl 5371 1 ) according to the manufacturer's instructions. The resulting plasmid was cut with Ncol and BamHI 
and the stromelysin fragment was subcloned into the Novagen expression vector pET3d (Novagen, Inc., 597 Science 

40 Drive, Madison, Wl 53711), again using the manufacturer's instructions. 

[0089] A mature stromelysin expression construct coding for amino acid residues 81 -256 plus an initiating methionine 
was generated from the 1 -256 expression construct by PCR amplification. The resulting PCR fragment was first cloned 
into the Novagen pT7Blue(R) vector and then subcloned into the Novagen pET3d vector, using the manufacturer's 
instructions in the manner described above, to produce plasmid (pETST-83-256). This final plasmid is identical to that 

45 described by Qi-Zhuang et al.. Biochemistry, 31: 11231-11235 (1992) with the exception that the present codes for a 
peptide sequence beginning two amino acids earlier, at position 81 in the sequence of human stromelysin. 
[0090] Plasmid pETST-83-256 was transformed into E. coli strain BL21(DE3)/pLysS (Novagen, Inc., 597 Science 
Drive, Madison, Wl 53711) in accordance with the manufacturer's instructions to generate an expression strain, BL21 
(DE3)/pLysS/pETST-255- 1. 

so [0091] A precutture medium was prepared by dissolving 1 .698 g of Na2HP 4 »7H 2 0, 0.45 g of KH 2 P0 4 , 0.075 g NaCI, 
0.150 g 15 NH 4 CI, 0.300 13 C-glucose, 300 u.Lof 1M aqueous MgS0 4 solution and 15 uXof aqueous CaCI 2 solution in 
150 mL of deionized water. 

[0092] The resulting solution of precutture medium was sterilized and transferred to a sterile 500 mL baffle flask. 
Immediately prior to inoculation of the preculture medium with the bacterial strain, 150 u.L of a solution containing 34 
55 mg/mL of chloramphenicol in 100% ethanol and 1 .5 mL of a solution containing 20 mg/mL of ampicillin were added to 
the flask contents. 

[0093] The flask contents were then inoculated with 1 mL of glycerol stock of genetically-modified E. Coli, strain 
BL21(DE3)/pLysS/pETST-255-1. The flask contents were shaken (225 rpm) at 37°C until an optical density of 0.65 
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was observed. 

[0094] A fermentation nutrient medium was prepared by dissolving 11 3.28 g of Na 2 HP 4 »7H 2 0, 30 g of KH 2 P0 4 , 5 g 
NaCI and 10 mL of 1% DF-60 antrfoam agent in 9604 ml_ of deionized water. This solution was placed in a New 
Brunswick Scientific Micros Fermenter (Edison, NJ) and sterilized at 121°C for 40 minutes. 

s [0095] Immediately prior to inoculation of the fermentation medium, the following pre-sterilized components were 
added to the fermentation vessel contents: 100 mL of a 10% aqueous solution of 1S NH 4 CI, solution of a 10% aqueous 
solution of 13 C -glucose, 20 mL of an aqueous 1 M solution of MgS0 4 , 1 mL of an aqueous 1 M CaCI 2 solution, 5 mL of 
an aqueous solution of thiamin hydrochloride (10mg/mL), 10mLof a solution containing 34 mg/mL of chloramphenicol 
in 100% ethanol and 1 .9 g of ampicillin dissolved in the chloramphenicol solution. The pH of the resulting solution was 

10 adjusted to pH 7.00 by the addition of an aqueous solution of 4N H 2 S0 4 . 

[0096] The preculture of E. Co\\, strain BL21(DE3)/pLysS/pETST-255-1, from the shake-flask scale procedure de- 
scribed above was added to the fermentor contents and cell growth was allowed to proceed until an optical density of 
0.48 was achieved. During this process, the fermenter contents were automatically maintained at pH 7.0 by the addition 
of 4N H 2 S0 4 or 4N KOH as needed. The dissolved oxygen content of the fermenter contents was maintained above 
55% air saturation through a cascaded loop which increased agitation speed when the dissolved oxygen content 
dropped below 55%. Air was fed to the fermenter contents at 7 standard liters per minute (SLPM) and the culture 
temperature was maintained at 37°C throughout the process. 

[0097] The cells were harvested by centrifugation at 17,000 x g for 10 minutes at 4°C and the resulting cell pellets 
were collected and stored at -85°C. The wet cell yield was 3.5 g/L. Analysis of the soluble and insoluble fractions of 
20 cell lysates by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) revealed that approximately 
50% of the l5 N-stromelysin was found in the soluble phase. 

[0098] The isotopically-labeled stromelysin fragment prepared as described above was purified employing a modi- 
fication of the technique described by Ye et al., Biochemistry, 31: 11231-11235 (1992). 

[0099] The harvested cells were suspended in 20 mM Tris-HCI buffer (pH 8.0) sodium azide solution containing! 
25 mM MgCI 2 , 0.5 mM ZnCI 2 , 25 units/mL of Benzonase® enzyme, and an inhibitor mixture made up of 4-(2-aminoethyl) 
benzenesulfonyl fluoride ("AEBSF'), Leupeptin®, Aprotinin®, and Pepstatin® (all at concentrations of 1 (ig/mL AEBSF, 
Leupeptin®, Aprotinin®, and Pepstatin® are available from American International Chemical, 17 Strathmore Road, 
Natick, MA 01760.) 

[0100] The resulting mixture was gently stirred for one hour and then cooled to 4°C. The ceils were then sonically 
30 disrupted using a 50% duty cycle. The resulting lysate was centrifuged at 14,000 rpm for 30 minutes and the pellet of 
insoluble fraction frozen at -80°C for subsequent processing (see below). 

[0101] Solid ammonium sulfate was added to the supernatant to the point of 20% of saturation and the resulting 
solution loaded onto a 700 mL phenyl sepharose fast flow ("Q-Sepharose FF") column (Pharmacia Biotech., 800 Cen- 
tennial Ave., P. O. Box 1327, Piscataway, NJ 08855). Prior to loading, the sepharose column was equilibrated with 50 
35 mM Tris-HCI buffer (pH 7.6 at 4°C), 5 mM CaCI 2 , and 1 M (NH 4 ) 2 S0 4 . The loaded column was eluted with a linear 
gradient of decreasing concentrations of aqueous (NH 4 ) 2 S0 4 (from 1 down to 0 M) and increasing concentrations of 
aqueous CaCl 2 (from 5 to 20 mM) in Tris-HCI buffer at pH 7.6. 

[0102] The active fractions of eluate were collected and concentrated in an Amicon stirred cell (Amicon, Inc., 72 
Cherry Hill Drive, Beverly, MA 01 91 5). The concentrated sample was dialyzed overnight in the starting buffer used with 
40 the Q-Sepharose FF column, 50 mM Tris-HCI (pH 8.2 at 4°C) with 10 mM CaCI 2 . 

[0103] The dialyzed sample was then loaded on the Q-Sepharose FF column and eluted with a linear gradient com- 
prising the starting buffer and 200 mM NaCI. The purified soluble fraction of the isotopically-labeled stromelysin frag- 
ment was concentrated and stored at 4°C. 

[0104] The pellet was solubilized in 8M guanidine-HCI. The solution was centrifuged for 20 minutes at 20,000 rpm 
45 and the supernatant was added dropwise to a folding buffer comprising 50 mM Tris-HCI (pH 7.6), 10 mM CaCI 2 0.5 
mM ZnCI 2 and the inhibitor cocktail of AEBSR Leupeptin®, Aprotinin®, and Pepstatin® (all at concentrations of 1 u.g/ 
mL). The volume of folding buffer was ten times that of the supernatant. The mixture of supernatant and folding buffer 
was centrifuged at 20,000 rpm for 30 minutes. 

[0105] The supernatant from this centrifugation was stored at 4°C and the pellet was subjected twice to the steps 
so described above of solubilization in guanidine-HCI, refolding in buffer, and centrifugation. The final supematants from 
each of the three centrifugations were combined and solid ammonium sulfate was added to the point of 20% saturation. 
The resulting solution thus derived from the insoluble fraction was subjected to purification on phenyl Sepharose and 
Q-Sepharose as described above for the soluble fraction. 

[0106] The purified soluble and insoluble fractions were combined to produce about 1.8 mg of purified isotopically- 
55 labeled stromelysin 81 -256 fragment per gram of original cell paste. 
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B. Human papillomavirus (HPV) E2 Inhibitors 

[0107] The papillomaviruses are a family of small DNA viruses that cause genital warts and cervical carcinomas. 
The E2 protein of HPV regulates viral transcription and is required for viral replication. Thus, molecules that block the 

s binding of E2 to DNA may be useful therapeutic agents against HPV The protein rather than the DNA was chosen as 
a target, because it is expected that agents wi:h greater selectivity would be found that bind to the protein rather than 
the DNA. > 
[0108] The DNA-binding domain of human papillomavirus E2 was cloned from the full length DNA that codes for E2 
using PCR and overexpressed in bacteria using the T7 expression system. Uniformly 15 N*labeled protein was isolated 

10 from bacteria grown on a minimal medium containing 15 N-labeled protein was isolated from bacteria grown on a minimal 
medium containing 15 N-labeled ammonium chloride. The protein was purified from the bacterial cell lysate using an S- 
sepharose FastFlow column pre-equilibrated with buffer {50 mM Tris, 100 mM NaCI, 1 mM EDTA, pH = 8.3). 
[0109] The protein was eluted with a linear gradient of 100-500 mM NaCI in buffer, pooled, and applied to a Mono- 
S column at a pH = 7.0. The protein was eluted with a salt gradient (100-500 mM), concentrated to 0.3 mM, and 

15 exchanged into a TRIS (50 mM, pH = 7.0 buffered H 2 O/D 2 0 (9/1) solution containing sodium azide (0.5%). 

C. RAF 

[0110] Uniformly 15 N-labeled Ras-binding domain of the RAF protein was prepared as described in Emerson et al., 
20 Biochemistry, 34 (21): 6911-6918 (1995). 

. D. FKBP 

[0111] Uniformly 15 N-labeled recombinant human FK binding protein (FKBP) was prepared as described in Logan 
25 et al., J. Mol. Biol., 236: 637-648 (1 994). 

Example 2 

Screening Compounds Using Two-Dimensional ^ S HPH NMR Correlation Spectral Analysis 

30 

[0112] The catalytic domain of stromelysin was prepared in accordance with the procedures of Example 1. The 
. protein solutions used in the screening assay contained the uniformly 15 N-labeled catalytic domain of stromelysin (0.3 
mM), acetohydroxamic acid (500 mM), CaCI 2 (20 mM), and sodium azide (0.5%) in a H 2 0/D 2 0 (9/1 ) TRIS buffered' 
solutbn (50 mM, pH=7.0). 

35- [0113] Two-dimensional 15 N/ 1 H NMR spectra were generated at 29°C on a Bruker AMX500 NMR spectrometer 
' equipped with a triple resonance probe and Bruker sample changer. The ^N^H HSQC spectra were acquired as 80 
x 1024 complex points using sweep widths of 2000 Hz ( 15 N, t 1 ) and 8333 Hz ( 1 H, t2). A delay of 1 second between 
scans and 8 scans per free induction decay(fid) were employed in the data collection. All NMR spectra were processed 
and analyzed on Silicon Graphics computers using in-house-written software. 

^0 [0114] A first two-dimensional 15 N/ 1 H NMR correlation spectrum was acquired for the 15 N-labeled stromelysin target 
molecule as described above. The stromelysin target was then exposed to a database of test compounds. Stock so- 
lutions of the compounds were made at 1 00 mM and 1 M. In addition, a combination library was prepared that contained 
8-10 compounds per sample at a concentration of 100 mM for each compound. 

[0115] The pH of the I M stock solution was adjusted with acetic acid and ethanolamine so that no pH change was 
4$ observed upon a 1/10 dilution with a 100 mM phosphate buffered solution (pH = 7.0). It is important to adjust the pH, 
because small changes in pH can alter the chemical shifts of the biomolecules and complicate the interpretation of the 
NMR data. 

[01 1 6] The compounds in the database were selected on the basis of size (molecular weight = 1 00-300) and molec- 
ular diversity. The molecules in the collection had different shapes (e.g., flat aromatic rings(s), puckered aliphatic rings 
so (s), straight and branched chain aliphatics with single, double, or triple bonds) and diverse functional groups (e.g., 
carboxylic acids, esters, ethers, amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the pos- 
sibility of discovering compound that interact with widely diverse binding sites. 

[0117] The NMR samples were prepared by adding 4 u.l of the DMSO stock solution of the compound mixtures that 
contained each compound at a concentration of 1 00 mM to 0.4 ml h^O^O (9/1 ) buffered solution of the uniformly 1 5 N- 
55 labeled protein. The final concentration of each of the compounds in the NMR sample was about 1 mM. 

[01 18] In an initial screen, two compounds were found that bind to the catalytic domain of stromelysin. Both of these 
compounds contain a biaryl moiety. Based on these initial hits, structurally similar compounds were tested against 
stromelysin. The structure of those biaryl compounds is represented by the structure I, below. (See Table 1 for definitions 
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of FVR 3 and 



5 




R 2 



io [01 19] In the second round of screening, binding was assayed both in the absence and in the presence of saturating 
amounts of acetohydroxamic acid (500 mM). 

[0120] Many of the biaryl compounds were found to bind the catalytic domain of stromelysin. FIG. 4 shows a repre- 
sentative two-dimensional 15 N/ 1 H NMR correlation spectrum before and after exposure of stromelysin to a biaryl test 
compound. It can be seen from FIG. 4 that the compound caused chemical shifts of 15 N-sites such as those designated 

is W124, T187, A199 and G204. 

[0121] These sites correspond to a tryptophan (Trp) residue at position 124, a threonine (Thr) at position 187, an 
alanine (Ala) at position 1 99, and a glycine (Gly) at position 204 of SEQ ID NO. 1 . FIG. 9 shows the correlation between 
the NMR binding data and a view of the NMR-derived three-dimensional structure of the catalytic domain of stromelysin. 
The ability to locate the specific binding site of a particular ligand is an advantage of the present invention. 

20 [01 22] Some compounds only bound to stromelysin in the presence of hydroxamic acid. Thus, the binding affinity of 
some compounds was enhanced in the presence of the hydroxamic acid (i. e. cooperative). These results exemplify 
another important capability of the present screening assay: the ability to identify compounds that bind to the protein 
in the presence of other molecules. 

[0123] Various biaryl compounds of structure I were tested for binding to stromelysin at differing concentrations. 
25 The 1 5 N/ 1 H spectra generated at each concentration were evaluated to quantify differences in the spectra as a function 
of compound concentration. A binding or dissociation constant (KD)was calculated, using standard procedures well 
known in the art, from those differences. The results of this study are shown in Table 1. The values for R1-R3 and 
A1 -A3 in Table 1 refer to the corresponding positions in the structure 1 , above. 

30 Table 1 
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so 

[01 24] The data in Table 1 show the utility of a process of the present invention in determining dissociation or binding 
constants between a ligand and a target molecule. 

[0125] Another advantage of an NMR screening assay of the present invention is the ability to correlate observed 
chemical shifts from the two-dimensional 15 N/ 1 H NMR correlation spectra with other spectra or projections of target 
55 molecule configuration. The results of a representative such correlation are shown in FIG. 9, which depicts regions 
within the polypeptide at which binding with the substrate molecule is most likely occurring. In this Figure, the apparent 
binding regions in stromelysin are shown for Compound 1 (from Table 1 ). 

[0126] Compounds from the database were screened in a similar manner for binding to the DNA-binding domain of 
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the E2 protein. Those compounds had the structure li below, where fVF^ and A are defined in Table 2. 



5 




JO 

[0127] NMR experiments were performed at 29°C on a Bruker AMXSOO NMR spectrometer equipped with a triple 
resonance probe and Bruker sample changer. The 15 N-/ 1 H HSQC spectra were acquired as 80 x 1 024 complex points 
using sweep widths of 2000 Hz (^N,^) and 8333 Hz ( 1 H, t 2 ). A delay of 1 second between scans and 4 scans per free 
is induction decay were employed in the data collection. All NMR spectra were processed and analyzed on Silicon Graph- 
ics computers. 

[0128] FIGS. 2 and 3 show representative two-dimensional 15 N/ 1 H NMR correlation spectra before and after expo- 
sure of the DNA-binding domain of E2 to a first and second test compound, respectively. 

[0129] It can be seen from FIG. 2 that the first test compound caused chemical shifts of 15 N-sites such as those 
20 designated 115, Y21, R22 and L23. Those sites correspond to an tsoleucine (Me) residue at position 15, a tyrosine 
residue (Tyr) at position 21 , an arginine (Arg residue at position 22 and a leucine (Leu) residue at position 23 of SEQ 
ID NO. 6. 

[0130] It can be seen from FIG. 3 that the second test compound caused chemical shifts in the particular 15 N-sites 
designated 16, G11, H38, and T52. Those sites correspond to an isoleucine (lie) residue at position 6, a glycine (Gly) 
25 residue at position 11 , a histidine (His) residue at position 38 and a threonine (Thr) at position 52 of SEQ ID NO. 6. 
[0131] FIGs. 7 and 8 show the correlation between those NMR binding data and a view of the NMR-derived three- 
dimensional structure of the DNA-binding domain of E2. 

[0132] Several structurally similar compounds caused chemical shift changes of the protein signals when screened 
at a concentration of 1 mM. Two distinct sets of amide resonances were found to change upon the addition of the 
30 compounds: one set of signals corresponding to amides located in the ^-barrel formed between the two monomers 
and a second set corresponding to amides located near the DNA-binding site. 

[0133] For example, compounds containing two phenyl rings with a carboxylic acid attached to the carbon linking 
the two rings only caused chemical shift changes to the amides in the DNA-binding site. In contrast, benzophenones 
and phenoxyphenyl-containing compounds only bound to the p-barrel. Other compounds caused chemical shift chang- 
es es of both sets of signals but shifted the signals in each set by different amounts, suggesting the presence of two 
distinct binding sites. 

[01 34] By monitoring the chemical shift changes as a function of ligand concentration, binding constants for the two 
binding sites were also measured. The results of those studies are summarized below in Table 2. 

40 Table2 
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a a dash (-) for A indicates no atom (i.e. byphenyl linkage) 
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[0135] Uniformly 15 N-labeted Ras-binding domain of the RAF protein was prepared as described in Example 1 and 
screened using two-dimensional 15 N/ 1 H NMR correlation spectral analysis in accordance with the NMR procedures 
described above. The results of a representative study are shown in FIG. 5, which depicts two<iimensional 15 N/ 1 H 
NMR correlation spectra both before and after exposure to a test compound. 

[0136] Uniformly 15 N-labeled FKBP was prepared as described in Example 1 and screened using two- 
dimensional 15 N/ 1 H NMR correlation spectral analysis in accordance with the NMR procedures described above. The 
results of a representative study are shown in FIG. 6, which depicts two-dimensional ^N^H NMR correlation spectra 
both before and after exposure to a test compound. 

Example 3 

Comparison of NMR Enzymatic, Filter Binding and Gel Shift Screening Assays 

[01 37] Studies were performed to compare binding constants of ligands to various biomolecules, determined by the 
NMR method of the present invention, to similar results obtained from prior art methods. 

[0138] In a first study, binding constants were determined, both by the NMR method of the present invention, and 
by a prior art enzymatic assay. The target molecule was the catalytic domain of stromelysin prepared in accordance 
with the procedures of Example 1. The NMR binding constants, KD, were derived using two-dimensional ^N^H NMR 
correlation spectroscopy as described in Example 2. The K D values so obtained were compared to an inhibition constant 
K, as determined in an enzymatic assay. 

[01 39] The enzymatic assay measured the rate of cleavage of a fluorogenic substrate by following the fluorescence 
increase upon peptide cleavage which causes a separation between the fluorophore and quencher. Enzymatic activity 
was measured using a matrix of different concentrations of acetohydroxamic acid and biaryl compounds. The assay 
is a modification of the method described by H. Weingarten, era/, in Anal. Biochem., 147: 437-440 (1985) employing 
the fluorogenic substrate properties described by E. Matayoshi, etai in Science : 247: 954-958 (1990). 
[01 40] Eight acetohydroxamic acid concentrations were used ranging from 0.0 to 1 .0 M, and six compound concen- 
trations were used, resulting in a total of 48 points. Individual compound concentration varied due to solubility and 
potency. 

[0141] All NMR measurements were performed in the presence of 500 mM acetohydroxamic acid, except for the 
titration of acetohydroxamic acid itself. Dissociation constants were obtained from the dependence of the observed 
chemical shift changes upon added ligand. Inhibition constants were then obtained from the inhibition data using stand- 
ard procedures. 

[0142] The results of these studies are summarized below in Table 3, which shows the comparison of NMR-derived 
dissociation constants (Kq) with inhibition constants measured in the enzyme assay (K t ) using a fluorogenic substrate. 



Table 3 



Compound No. 


NMR K D (mM) 


Assay K t (mM) 


4 


1.6 


7.4 


7 


0.17 


0.32 


9 


0.16 


0.70 


10 


0.40 


1.8 


12 


0.02 


0.11 


Acetohydroxamic acid 


17.0 


21.1 



[01 43] The data in Table 3 show that a NMR process of the present invention provides a rapid, efficient and accurate 
way of determining dissociation or binding constants of ligands to target biomolecules. Comparison of the binding 
constants determined by the two methods result in the same ranking of potencies of the compounds tested. That is, 
while the values for a given substrate as determined by the two methods are not equal, they are proportional to one 

another. 

[0144] In a second study, the results for binding of the DNA-binding domain of E2 to its target DNA were obtained 
by prior art methods and compared with results obtained by the method of the present invention. The target was the 
-DNA-binding domain of E2 t prepared in accordance with the procedures of Example 1 . NMR screening assays and 
NMR processes for determining ligand dissociation constants were performed as set forth above in Example 2. 
[0145] The binding constant from the NMR process was compared to the results of a physical, filter binding assay 
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that measured binding of DNA to the target. The high-throughput filter binding assay was performed using E2, prepared 
according to Example 2 above. The 33 P-labeled DNA construct comprised a 10,329 base pair plasmid formed by in- 
serting the HPV-1 1 genome, containing three high affinity and one low affinity E2 binding sites, into the PSP-65 ptasmid . 
(Promega, Madison, Wl). 

5 [0146] The binding affinities at the different sites as determined by NMR were compared for a subset of the com- 
pounds to the inhibition of E2 binding to DNA as measured in the filter binding assay As shown in Table 2 above, the 
activities determined in the filter binding assay correlated closely with the binding affinities calculated from the amides 
of the DNA-binding site but not to the affinities measured for the p-barrel site. This is consistent with the relative locations 
of each site. 

10 [0147] In an alternative study, a comparison of the NMR-determined binding results was made with similar results 
obtained by a prior art gel-shift assay using techniques well known in the art. The gel-shift assay was performed using 
a GST fusion protein which contained full length E2 and a 33 P-labeled 62 base pair DNA fragment containing two E2 
binding sites. 

[01 48] The method identified numerous compounds which gave positive results in the gel-shift assay. Some of these 
is positive results, however, were believed to be due to binding to the DNA, since in these cases, no binding to the E2 
protein was observed using the NMR method of this invention. These compounds were shown to indeed bind to DNA 
rather than to E2, as evidenced by changes in the chemical shifts of the DNA rather than the protein upon the addition 
of the compounds. These data show that yet another advantage of the present invention is the ability to minimize the 
occurrence of false positives. 

20 

Example 4 

Use of the screening process of the present invention in the design of a potent, non-peptide inhibitor of stromelysin 

25 [0149] Studies were performed to design new ligands that bound to the catalytic domain of stromelysin. Because 
stromelysin undergoes autolysis, an inhibitor was sought to block the degradation of stromelysin. That inhibitor would 
facilitate the screening of other potential ligands that bind to other sites on the enzyme. 

[0150] The criteria used in selecting compounds in the screening for other binding sites was based primarily on the 
size of the ligand. The smallest ligand was sought that had enough solubility to saturate (>98% occupancy of enzyme) 

30 and inhibit the enzyme. 

[0151] The cloning, expression, and purification of the catalytic domain of stromelysin was accomplished using the 
procedures set forth in Example 1 . An initial step in the design of the new ligand was the identification of a first ligand 
that bound to the stromelysin target. Such identification was carried out in accordance with a two-dimensional ^N^H 
NMR correlation screening process as disclosed above. 

35 [01 52] A variety of hydroxarnic acids of the general formula R-(CO)NHOH were screened for binding to stromelysin 
using the procedures set forth in Example 2. Of the compounds tested, acetohydroxamic acid [CH3(CO)NHOH] best 
satisfied the selection criteria: it had a binding affinity for stromelysin of 17 mM and had good water solubility. At a 
concentration of 500 mM, acetohydroxamic acid inhibited the degradation of the enzyme, allowing the screening of 
other potential ligands. 

40 [0153] The second step in the design process was the identification of a second ligand that bound to the target 
stromelysin at a site different from the binding site of acetohydroxamic acid. This was accomplished by screening 
compounds for their ability to bind stromelysin in the presence of saturating amounts of acetohydroxamic acid. Details 
of procedures and results of this second identification step are set forth above in Example 2. 
[0154] The compound identified as a second ligand from these studies and used in subsequent design steps was 

45 the compound designated as Compound #4 in Table 1 (See Example 2). 

[0155] The next step in the design process was to construct a ternary complex of the target stromelysin, the first 
ligand and the second ligand. This was accomplished by exposing the stromelysin target to the two ligands under 
conditions that resulted in complex formation. The three-dimensional structure of the ternary complex was then deter- 
mined using NMR spectroscopy as described below. 

so [0156] The 1 H, 13 C, and 15 N backbone resonances of stromelysin in the ternary complex were assigned from an 
analysis of several 3D double- and triple-resonance NMR spectra (A. Bax era/.. Acc. Chem. Res., 26: 131-138(1993)). 
The C a resonances of adjacent spin systems were identified from an analysis of three-dimensional (3D) HNCA (L. Kay 
etal, J. Maqn. Reson., 89: 496-514 (1990)) and HN(CO)CA (A. Bax, era/., J. Bio. NMR, 1:99 (1991)) spectra recorded 
with identical spectral widths of 1773 Hz (35.0 ppm), 3788 Hz (30.1 ppm), and 8333 Hz (16.67 ppm) in the F^N), 

55 F 2 ( 13 C) and F 3 ( 1 H) dimensions, respectively. 

[0157] The data matrix was 380,) x 48(t 2 ) x 1024(t 3 ) complex points for the HNCA spectrum, and 32(t,) x 40(t 2 ) x 
1024(t 3 ) complex points for the HN(CO)CA spectrum. Both spectra were acquired with 16 scans per increment. A 3D 
CBCA(CO)NH spectrum (S. Grzesiek, etal.. J. Am. Chem. Soc., 114: 6261-6293 (1992)) was collected with 32(t 1t 15 N) 
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x 48(t 2 , 13 C) x 1024(t 3 , *H) complex points and 32 scans per increment. Spectral widths were 1773 Hz (35.0 ppm), 
7575.8 Hz (60.2 ppm), and 8333 Hz (16.67 ppm) in the l5 N, 13 C and dimensions, respectively. 
[0158] For all three spectra, the 1 H carrier frequency was set on the water resonance and the 15 N carrier frequency 
was at 119.1 ppm. The 13 C carrier frequency was set to 55.0 ppm in HNCA and HN(CO)CA experiments, and 46.0 

5 ppm in the CBCA(CO)NH experiment. 

[01 59] The backbone assignments were confirmed from an analysis of the crosspeaks observed in an 15 N-separated 
3D NOESY-HSQC spectrum and a 3D HNHA-J spectrum. The 15 N-separated 3D NOESY-HSQC spectrum (S. Fesik, 
etaL J. Maqn.Reson., 87: 588-593 (1988)); D. Marion, era/., J. Am. Chem. Soc., 111: 1515-1517(1989)) was collected 
with a mixing time of 80 ms. A total of 68(t 1f 15 N)x 96(t 2 , 1 H)x 1024(t 3 , 1 H) complex points with 16 scans per increment 

w were collected, and the spectral widths were 1773 Hz (35.0 ppm) for the 15 N dimension, 6666.6 Hz (t 2 , 1 H, 13.3 ppm), 
and 8333 Hz (16.7 ppm) for the 1 H dimension. 

[0160] The 3D HNHA-J spectrum (G. Vuister, et aL, J. Am. Chem. Soc., 115: 7772-7777 (1993)), which was also 
used to obtain 3 J H NHa coupling constants, was acquired with 35(t 1( 15 N) x 64(t 2 , 1 H) x 1024(t 3 , *H) complex points and 
32 scans per increment. Spectral widths and carrier frequencies were identical to those of the 15 N-separated NOESY- 
15 HSQC spectrum. Several of the HP signals were assigned using the HNHB experiment. The sweep widths were the 
same as in the 15 N-separated NOESY-HSQC spectrum that was acquired with 32(t 1t i$N) x 96(t 2 , 1 H) x 1024 (t 3 , 1 H) 
complex points. 

[0161] The 1 H and 1 3 C chemical shifts were assigned for nearly all sidechain resonances. A 3D HCCH-TOCSY spec- 
trum (L. Kay, era/., J. Maqn. Reson., 101b: 333-337 (1993)) was acquired with a mixing time of 1 3 ms using the DIPSI- 
20 2 sequence (S. Rucker, era/., Mol. Phys. , 68:509(1989)) for 13 C isotropic mixing. A total of 96 (t v 13 C)x96(t 2 , 1 H) x 
1 024(t 3 , 1 H) complex data points were collected with 1 6 scans per increment using a spectral width of 1 0638 Hz (70.8 
ppm, Wt), 4000 Hz (6.67 ppm, w 2 ), and 4844 (8.07 ppm, w 3 ). Carrier positions were 40 ppm, 2.5 ppm, and at the water 
frequency for the 13 C, indirectly detected 1 H, and observed 1 H dimensions, respectively. 

[0162] Another 3D HCCH-TOCSY study was performed with the 13 C carrier at 122.5 ppm to assign the aromatic 
25 residues. The spectra were collected with 36(t v 13 C) x 48(t 2 , 1 H) x 1024 (t 3 , 1 H) complex points with spectral widths of 
5263 Hz (35.0 ppm, 3180 Hz (5.30 ppm, w 2 ), and 10,000 (16.7 ppm, w 3 ). Carrier positions were 122.5 ppm, 7.5 
ppm, and at the water frequency for the 13 C, indirectly detected 1 H, and observed 1 H dimensions, respectively. 
[0163] A 13 C-separated 3D NOESY-HMQC spectrum (S. Fesik, et al., J. Magn . Reson. , 87: 588-593 (1988)); D. 
Marion, et aL, J. Am. Chem. Soc., 111: 1515-1517 (1989)) was recorded using a mixing time of 75 ms. A total of 80 
30 (t 1f 13 C) x 72 (t 2 , 1 H) x 1024 (t 3 , 1 H) complex data points with 16 scans per increment were collected over spectral 
widths of 10638 Hz (70.49 ppm, w^, 6666.6 Hz (13.3 ppm, w 2 ), and 8333.3 Hz (16.67 ppm, w 3 ). The *H carrier fre- 
quencies were set to the water resonance, and the 13 C carrier frequency was placed at 40.0 ppm. 
[0164] Stereospecific assignments of methyl groups of the valine and leucine residues were obtained by using a 
biosynthetic approach (Neri et al, Biochem., 28: 7510-7516 (1989)) on the basis of the 13 C- 13 C one-bond coupling 
os pattern observed in a high-resolution 1 H, 13 C-HSQC spectrum (G. Bodenhausen, et aL, J. Chem. Phys. Lett., 69: 
■ 185-189 (1980)) of a fractionally 13 C -labeled protein sample. The spectrum was acquired with 200 ( 13 C, t^ x2048( 1 H, 
t 2 ) complex points over spectral widths of 5000 Hz (39.8 ppm, 13 C) and 8333 Hz (16.7 ppm, 1 H). Carrier positions were 
20.0 ppm for the 13 C dimension, and at the water frequency for the 1 H dimension. 

[0165] To detect NOES between the two ligands and the protein, a 3D 12 C-filtered, 13 C-edited NOESY spectrum was 

40 collected. The pulse scheme consisted of a double 13 C-filter sequence (A. Gemmeker, et al, J. Magn. Reson., 96: 
1 99-204 (1 992)) concatenated with a NOESY-HMQC sequence (S. Fesik, etal, J. Magn. Reson. , 87: 588-593 (1988)); 
D. Marion, etal.. J. Am. Chem. Soc. , 111: 1515-1517 (1989)) . The spectrum was recorded with a mixing time of 80 
ms, and a total of 80 (tl, 13 C) x 80 (t^ 1 H) x 1024 (t 3 , 1 H) complex points with 16 scans per increment. Spectral widths 
were 8865 Hz (17.73 ppm, Wt), 6667 Hz (1 3.33 ppm, w 2 ), and 8333 Hz (1 6.67 ppm, w 3 ), and the carrier positions were 

45 40.0 ppm for the carbon dimension and at the water frequency for both proton dimensions. 

[0166] To identify amide groups that exchanged slowly with the solvent, a series of 1 H, 15 N-HSQC spectra (G. Bod- 
enhausen, et aL, J. Chem. Phys. Lett., 69: 185-189 (1980)) were recorded at 25°C at 2 hr intervals after the protein 
was exchanged into D 2 0. The acquisition of the first HSQC spectrum was started 2 hrs. after the addition of D 2 0. 
[0167] All NMR spectra were recorded at 25°C on a Bruker AMX500 or AMX600 NMR spectrometer. The NMR data 

50 were processed and analyzed on Silicon Graphics computers. In all NMR experiments, pulsed field gradients were 
applied where appropriate as described (A. Bax, et aL, J. Maqn. Reson., 99: 638 (1992)) to afford the suppression of 
the solvent signal and spectral artifacts. Quadrature detection in indirectly detected dimensions was accomplished by 
using the States-TPPI method (D. Marion, ef at., J. Am. Chem. Soc., 111: 1515-1517 (1989)). Linear prediction was 
employed as described (E. Olejniczak, et aL, J. Maqn. Reson. , 87: 628-632 (1 990)). 

55 [01 68] The derived three-dimensional structure of the ternary complex was then used to define the spatial orientation 
of the first and second ligands to each other as well as to the target stromelysin molecule. 

[0169] Distance restraints derived from the NOE data were classified into six categories based on the NOE cross 
peak intensity and given a lower bound of 1 .8 A and upper bounds of 2.5 A, 3.0 A, 3.5 A, 4.0 A, 4.5 A, and 5.0 A, 
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respectively. Restraints for torsional angles were derived from 3 J HNHa coupling constants measured from the 3D 
HNHA-J spectrum (G. Vuister, et at., J. Am. Chem. See., 115: 7772-7777 (1993)). The 0 angle was restrained to 
120%±40% for 3 J HN Ha > 8-5 Hz, and 60%±40% for 3 J H nh<x < 5 Hz. 

[0170] Hydrogen bonds, identified for slowly exchanging amides based on initial structures, were defined by two 
5 restraints: 1.8-2.5 A for the H-O distance and 1.8-3.3 A for the N-O distance. Structures were calculated with the X- 
PLOR 3.1 program (A. Brunger, 'XPLOR 3.1 Manual." Yale University Press, New Haven, 1992) on Silicon Graphics 
computers using a hybrid distance geometry-simulated annealing approach (M. Nilges, et al. , FEBS Lett. , 229: 31 7-324 
(1988)). 

[0171] A total of 1032 approximate interproton distance restraints were derived from the NOE data. In addition, 21 
10 unambiguous intermolecular distance restraints were derived from a 3D 12C-filtered, 13C-edited NOESY spectrum. 
Of the 1032 NOE restraints involving the protein, 341 were intra-residue, 410 were sequential or short-range between 
residues separated in the primary sequence by less than five amino acids, and 281 were long-range involving residues 
separated by at least five residues. 

[0172] In addition to the NOE distance restraints, 14 $ dihedral angle restraints were included in the structure cat- 
's culations that were derived from three-bond coupling constants ( 3 JhnhJ determined from an HNHA-J spectrum (G. 
Viioster, et al.. J. Am. Chem. Soc. , 115: 7772-7777 (1993)). The experimental restraints also included 120 distance 
restraints corresponding to 60 hydrogen bonds. The amides involved in hydrogen bonds were identified based on their 
characteristically slow exchange rate, and the hydrogen bond partners from initial NMR structures calculated without 
the hydrogen bond restraints. The total number of non-redundant, experimentally-derived restraints was 1166. 
20 [0173] The structures were in excellent agreement with the NMR experimental restraints. There were no distance 
violations greater than 0.4 A, and no dihedral angle violations greater than 5 degrees. In addition, the simulated energy 
for the van der Waals repulsion term was small, indicating that the structures were devoid of bad inter-atomic contacts. 
[0174] The NMR structures also exhibited good covalent bond geometry, as indicated by small bond-length and 
bond-angle deviations from the corresponding idealized parameters. The average atomic root mean square deviation 
25 of the 8 structures for residues 93-247 from the mean coordinates was 0.93 A for backbone atoms (C a , N, and C), and 
1.43 A for all non-hydrogen atoms. 

[0175] A ribbon plot of the ternary complex involving stromelysin, acetohydroxamic acid (the first ligand), and the 
second ligand is shown in Fig 10. The structure is very similar to the global fold of other matrix metalloproteinases and 
consists of a five-stranded p-sheet and three a-helices. 
30 [01 76] The catalytic zinc was located in the binding pocket. It was coordinated to three histidines and the two oxygen 
atom of acetohydroxamic acid. A biaryl group of the second ligand was located in the SV pocket between the second 
helix and the loop formed from residues 218-223. This deep and narrow pocket is lined with hydrophobic residues 
which make favorable contacts with the ligand. 

[0177] Based on the three-dimensional structure of the ternary complex as determined above and the structure/ 
55 activity relationships observed for the binding to stromelysin of structural analogs of the second ligand (i.e. , other biaryl 
compounds), new molecules were designed that linked together the acetohydroxamic acid to biaryls. 
[0178] As shown in Table 4 below, the initial biaryls chosen contained an oxygen linker and the absence or presence 
of CN para to the biaryl linkage. Initial linkers contained varying lengths of methylene units. Means for linking compounds 
with linkers having varying lengths of methylene units are well known in the art. 

40 



Table 4 
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Compound 


X 


R 


Stromelysin 
Inhibition 


21 


' (CH 2 ) 2 


H 


0.31 uM 


22 


(CH 2 ), 


H 


HOuM 


23 


(CH 2 )4 


H 


38%@100nM 


24 


(CH 2 )5 


H 


43%@100jiM 


25 


(CH 2 ) 2 


CN 


0.025 \iM 


26 


(CH 2 )i 


CN 


3.4 \xM 


27 


(CH 2 ) 4 


CN 


3.5 |iM 


28 


(CH 2 )5 


CN 


1.7 ^iM 



20 [0179] As expected based on the better binding of the CN substituted biaryls to stromelysin, the CN derivatives 
exhibited better stromelysin inhibition. The compound that exhibited the best inhibition of stromelysin contained a linker 
with two methylene units. 

[01 80] The present invention has been described with reference to preferred embodiments. Those embodiments are 
not limiting of the claims and specification in any way. 

25 

SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
30 [0181] 

(i) APPLICANT: 

Fesik, Stephen W. 
35 Hajduk, Philip J. 

(ii) TITLE OF INVENTION: Use of Nuclear Magnetic Resonance to Identify Ligands to Target Biomolecules 

(iii) NUMBER OF SEQUENCES: 6 

40 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Steven F. Weinstock, Dept. 377 AP6D, Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 
45 (C) CITY: Abbott Park 

(D) STATE: Illinois 
(B) COUNTRY: USA 
(F) ZIP: 60064-3500 

so (v) COMPUTER READABLE FORM: . 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

55 (D) SOFTWARE: Patentln Release #1 .0, Version #1 .30 

(vi) CURRENT APPLICATION DATA: 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

5 (vtii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Janssen, Jerry F. 

(B) REGISTRATION NUMBER: 29 t 175 

10 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (708) 937-4558 

(B) TELEFAX: (708) 938-7742 

is (2) INFORMATION FOR SEQ ID NO:1: 
[0182] 

(i) SEQUENCE CHARACTERISTICS: 

20 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

30 

Phe Arg Thr Phe Pro Gly lie Pro Lys Trp Arg Lys Thr His Leu Thr 
1 5 10 15 

Tyr Arg lie Val Asn Tyr Thr Pro Asp Leu Pro Lys Asp Ala Val Asp 
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10 



15 



20 



20 25 30 

Ser Ala Val Glu Lys Aia Leu Lys Val Trp Glu Glu Val Thr Pro Leu 
35 40 45 

Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp He Met He Ser Phe 
50 55 60 

Ala Val Arg Glu His Gly Asp Phe Tyr Pro Phe Asp Gly Pro Gly Asn 
65 70 75 80 

Val Leu Ala His Ala Tyr Ala Pro Gly Pro Gly He Asn Gly Asp Ala 
85 90 95 

His Phe Asp Asp Asp Glu Gin Trp Thr Lys Asp Thr Thr Gly Thr Asn 
100 105 HO 

Leu Phe Leu Val Ala Ala His Glu He Gly His Ser Leu Gly Leu Phe 
115 120 125 

His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr His Ser Leu 
130 135 140 

Thr Asp Leu Thr Arg Phe Arg Leu Ser Gin Asp Asp He Asn Gly He 
2S 145 150 155 160 

Gin Ser Leu Tyr Gly Pro Pro Pro Asp Ser Pro Glu Thr Pro 
165 . 170 

30 (2) INFORMATION FOR SEQ ID NO:2: 
[0183] 

(i) SEQUENCE CHARACTERISTICS: 

35 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single - 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



45 



SO 
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Met Ala Thr Thr Pro lie lie His Leu Lys Gly Asp Ala Asn lie Leu 

15 10 15 

Leu Cys Leu Arg Tyr Arg Leu Ser Lys Tyr Ly3 Gin Leu Tyr Glu Gin 
20 25 30 



10 



Ser Val Ser Thr Trp His Trp Thr Cys Thr Asp Gly Lys His Lys Asn 
35 40 45 

Ala lie Val Thr Leu Thr Tyr He Ser Thr Ser Gin Ara Asp Asp Phe 
50 55 60 



15 



Leu Asn Thr Val Lys He Pro Asn Thr Val Ser Val Ser Thr Gly Tyr 
65 70 75 80 

Met Thr He 



(2) INFORMATION FOR SEQ ID NO:3: 

20 

[0184] 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (jj) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



GAAATGAAGA GTCTTCAA 



(2) INFORMATION FOR SEQ ID NO:4: 
[0185] 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

so 

GCGTCCCAGG TTCTCGAG 
18 

55 
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(2) INFORMATION FOR SEQ ID NO:5: 
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[0186] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



ATACCATGGC CTATCCATTG GATGGAGC 
28 



(2) INFORMATION FOR SEQ ID NO:6: 
[0187] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



ATAGGATCCT TAGGTCTCAG GGGAGTCAGG 
30 



Claims 

1. A process of screening compounds to identity compounds that are ligands that bind to a specific target molecule 
comprising the steps of: 

a) generating a first two-dimensional 15 N/ 1 H NMR correlation spectrum of a 15 N-labeled target molecule; 

b) exposing the labeled target molecule to one or a mixture of chemical compounds; 

c) generating a second two-dimensional 15 N/ 1 H NMR correlation spectrum of the labeled target molecule that 
has been exposed to one 

or a mixture of compounds in step (b); and 

d) comparing said first and second two-dimensional 15 N/ 1 H NMR correlation spectra to determine differences 
between said first and said second spectra, the differences identifying the presence of one or more compounds 
that are ligands which have bound to the target molecule. 

2. The process of claim 1 wherein the 15 N-labeled target molecule is exposed to a mixture of chemical compounds 
in step (b), further comprising the steps subsequent to step d) of 
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e) exposing the 15 N-labeled target molecule individually to each compound of said mixture, 

f ) generating a two-dimensional 1 5 N/ 1 H NMR correlation spectrum of the labeled target molecule that has been 
individually exposed to each compound; and 

g) comparing each spectrum generated in step f) to said first spectrum to determine differences in any of those 
s compared spectra, the differences identifying the presence of a compound that is a ligand which has bound 

to the target molecule. n 

3. The process of claim 1 wherein the differences in the two-dimensional 1 5 N/ 1 H NMR correlation spectra are chemical 
shifts at particular 15 N-labeled sites in the target molecule and chemical shifts in protons attached to those 15 N- 

10 labeled sites. 

4. The process of claim 1 wherein the target molecule is a polypeptide. 

5. A process of determining the dissociation constant between a target molecule and a ligand that binds to that target 
is ' molecule comprising the steps of: 

a) generating a first two-dimensional 15 N/ 1 H NMR correlation spectrum of a 15 N-labeled target molecule; 

b) exposing the labeled target molecule to various concentrations of a ligand; 

c) generating a two-dimensional 15 N/ 1 H NMR correlation spectrum at each concentration of ligand from step 
20 (b): 

d) comparing each spectrum from step (c) both to the first spectrum from step (a) and to all other spectra from 
step (c) to quantify differences in those spectra as a function of changes in ligand concentration; and 

e) calculating the dissociation constant between the target molecule and the ligand from those differences 
according to the equation: 

25 

„ _([P] 0 - x )([iV x 



30 where [P] 0 is the total molar concentration of target 

molecule; 

[L] 0 is the total molar concentration of ligand; and 
x is the molar concentration of the bound species determined 
35 according to the equation: 



40 



^ __ obs * free 



where 



&obs and s free are tne chemical shift values for the target molecule determined at each concentration of 
ligand and for the target molecule in the absence of ligand, respectively, and A is the difference between 
45 the chemical shift at saturating amounts of ligand and 5 free . 

6. The process of claim 5 wherein the target molecule is a polypeptide. 

7. The process of claim 5 further comprising the step of binding the labeled target molecule to a second ligand before 
so step (a). 

Patentanspruche 

55 1 . Verfahren zum Screening von Verbindungen zur Identifikation von Verbindungen, die an spezifische Zielmolekule 
bindende Liganden sind, wobei das Verfahren folgende Schritte umfaflt: 

a) Erzeugen eines ersten zweidimensionalen ^N^H NMR Korrelationsspektrums eines 15 N-markierten Ziel- 
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molekuls; 

b) Aussetzen des markierten Zielmolekuls mit einer chemischen Verbindung oder einer Mischung von chemi- 
schen Verbindungen; 

c) Erzeugen eines zweiten zweidimensionalen 15 N/ 1 H NMR Korrelationsspektrums des markierten Zielmole- 
kuls, das in Schritt b) einer oder einer Mischung von Verbindungen ausgesetzt worden ist; und 

d) Vergleich der ersten und zweiten zweidimensionalrsn 15 N/ 1 H NMR Korrelationsspektren, um Unterschiede 
zwischen den ersten und zweiten Spektren zu ermitteln, wobei diese Unterschiede die Anwesenheit einer 
oder mehrerer Verbindungen anzeigen, die Liganden sind, die an das Zielmolekul gebunden haben. 

Vertahren nach Anspruch 1, worin das 15 N-markierte Zielmolekdl in Schritt (b) einer Mischung von chemischen 
Verbindungen ausgesetzt wird, wobei das Verlahren noch folgende, nach Schritt d) erfolgende Schritte umfaGt: 

e) einzelnes Aussetzen des 15 N-markierten Zielmolekuls mit jeder Verbindung der Mischung, 

f) Erzeugen eines zweidimensionalen ^N^H NMR Korrelationsspektrums des markierten Zielmolekuls, das 
jeder Verbindung einzeln ausgesetzt worden ist; und 

g) Vergleich eines jeden in Schritt f) erzeugten Spektrums mit dem ersten Spektrum, um Unterschiede in 
irgendeinem der verglichenen Spektren zu ermitteln, wobei die Unterschiede die Anwesenheit einer Verbin- 
dung identifizieren, die ein Ligand ist, der an das Zielmolekul gebunden hat. 

Vertahren nach Anspruch 1, worin die Unterschiede in den zweidimensionalen 15 N/ 1 H NMR Korrelationsspektren 
chemische Verschiebungen an besonderen, 15 N -markierten Stellen des ZielmolekOls und chemische Verschie- 
bungen von Protonen sind, die an diese 15 N-markierten Stellen gebunden sind. 

Vertahren nach Anspruch 1, worin das Zielmolekul ein Polypeptid ist. 

Vertahren zur Bestimmung der Dissoziationskonstanten zwischen einem Zielmolekul und einem Liganden, der an 
das Zielmolekul bindet, wobei das Vertahren folgende Schritte umfafct: 

a) Erzeugen eines ersten zweidimensionalen ^N^H NMR Korrelationsspektrums eines 15 N-markierten Ziel- 
molekuls; 

b) Aussetzen des markierten Zielmolekuls mit verschiedenen Konzentrationen eines Liganden; 

c) Erzeugen eines zweidimensionalen ^N^H NMR Korrelationsspektrums bei jeder Ligandenkonzentration 
nach Schritt b); und 

d) Vergleich jeden Spektrums aus Schritt c) sowohl mit dem ersten Spektrum aus Schritt a) als auch mit alien 
anderen Spektren aus Schritt c), um die Unterschiede in diesen Spektren quantitativ als Funktion der sich 
verandernden Ligandenkonzentration zu bestimmen; und 

e) Berechnen der Dissoziationskonstanten zwischen dem Zielmolekul und dem Liganden, auf der Grundlage 
dieser Unterschiede und gemaQ der folgenden Gleichung: 

„ ([P] 0 -x)([L] o -x) 



worin 

[P] 0 die gesamte molare Konzentration des Zielmolekuls ist; 
[L] 0 die gesamte molare Konzentration des Liganden ist; und 

x die molare Konzentration der gebundenen Spezies ist, wie sie gemaG der folgenden Gleichung ermittelt 
wurde: 



- ^obs " free 
A 

worin 5 0bs und 5 free chemischen Verschiebungswerte fur das Zielmolekul sind, die jeweils bei jeder Li- 
gandenkonzentration und fur das Zielmolekul in Ligandenabwesenheit ermittelt wurden, und worin A der Un- 
terschied in der chemischen Verschiebung zwischen der Ligandensattigungsmenge und 5f ree ist. 
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6. Verfahren nach Anspruch 5, worin das Zielmotekul ein Polypeptid ist. 

7. Verfahren nach Anspruch 5, das des weiteren den Schritt der Bindung des markierten Zielmolekuls an einen zwei- 
ten Liganden vor Schritt a) umfaGt. 

5 

Revendications 

1. Procede de criblage de composes pour identifier des composes qui sont des ligands qui se lient a une molecule 
10 cible specifique comprenant les etapes de : 

a) production d'un premier spectre de correlation RMN 15 N/ 1 H bidimensionnelie d'une molecule cible marquee 
avec 15 N ; 

b) exposition de la molecule cible marquee a un compost chimique ou a un melange de composes chimiques ; 
is c) production d'un second spectre de correlation RMN 15 N/ 1 H bidimensionelle de la molecule cible marquee 

qui a ete exposee a un compose ou a un melange de composes dans Petape b) ; et 

d) comparaison desdits premier et second spectres de correlation RMN ^N^H bidimensionnelie pour deter- 
miner les differences entre tesdits premier et second spectres, les differences identifiant la presence d'un ou 
plusieurs composes qui sont des ligands qui se sont lies a la molecule cible. 

20 

2. Procede selon la revendication 1 , dans lequel la molecule cible marquee avec 15 N est exposee a un melange de 
composes chimiques dans I'etape b), comprenant en outre les etapes suivant I'etape d) de 

e) exposition de la molecule cible marquee avec 15 N individuellement a chaque compose dudit melange, 

25 f) production d'un spectre de correlation RMN ^N^H bidimensionnelie de la molecule cible marquee qui a 

ete exposee individuellement a chaque compose ; et 

g) comparaison de chaque spectre produit dans Petape f ) audit premier spectre pour determiner les differences 
dans I'un quelconque de ces spectres compares, les differences identifiant la presence d'un compost qui est 
un ligand qui s'est lie a la molecule cible. 

30 

3. Procede selon la revendication 1 , dans lequel les differences dans les spectres de correlation RMN 15 N/ 1 H bidi- 
mensionnelie sont des deplacements chimiques a des sites marques avec 15 N particuliers dans la molecule cible 
et des deplacements chimiques dans des protons fixes a ces sites marques avec 15 N. 

35 4. Precede selon la revendication 1 , dans lequel la molecule cible est un polypeptide. 

5. Procede de determination de la constante de dissociation entre une molecule cible et un ligand qui se lie a cette 
molecule cible, comprenant les etapes de : 

40 a) production d'un premier spectre de correlation RMN 15 N/ 1 H bidimensionnelie d'une molecule cible marquee 

avec 15 N ; 

b) exposition de la molecule cible marquee a differentes concentrations d'un ligand; 

c) production d'un spectre de correlation RNM 15 N/ 1 H bidimensionelle a chaque concentration de ligand de 
I'etape b) ; 

45 d) comparaison de chaque spectre de I'etape c) au premier spectre de I'etape a) et a tous les autres spectres 

de Petape c) pour quantifier les differences dans ces spectres en fonction de modifications de la concentration 
de ligand ; et 

e) calcul de la constante de dissociation entre la molecule cible et le ligand d'apres ces differences, selon 
I'equation : 

so 

„ ([P] 0 -x)([L] 0 -x) 



55 OU 



[P] 0 est ta concentration molaire totale de la molecule cible ; 
[L] 0 est la concentration molaire totale de ligand, et 
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x est la concentration molaire de I'espece liee determinee selon ('equation : 

&obs ' hbre 
X = A 

ou 5 0bs et 5| jbre sont les valeurs de deplacement chimique pour la molecule cible determinees a chaque 
concentration de ligand et pour la molecule cible en I'absence de ligand, respectivement, et A est la difference 
entre le deplacement chimique a des quantites saturantes de iigand et 6, jbre . 

Precede selon la revendication 5, dans tequel la molecule cible est un polypeptide. 

Procede selon la revendication 5, comprenant en outre I'etape de liaison de la molecule cible marquee a un second 
ligand avant I'etape a). 
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